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5 ; : Animal Integration Vector AND Methods FOR Its Use 

\ 

; INTRODUCTION 

' Field of the Invention 
10 The field of this invention is nucleic acid vectors. 

1 Background of the Invention 

1 pelivery of nucleic acids into a cell in culture or a whole animal is a process 
.known as transformation. Transformation plays a major role in a variety of biotechnology 
and related applications, including research, synthetic and therapeutic applications. 

1 5 ['Research applications where transformation in cultured cells plays a critical role include 
;the production of recombinant bacteria and yeast for cloning applications. Likewise, the 
jproduction of transgenic animals has provided invaluable knowledge into gene function, 
jcspeciaily C. elegans and D. melanogaster. Synthetic applications in which transformation 
•plays a critical role include the production of peptides and proteins, where both animal 

20 ^and cellular systems are employed. Therapeutic applications in which transformation 
jplays a key role include gene therapy applications, where both cellular and animal 
systems are employed. Because of the prevalent role transformation plays in the above 
md othier applications, a variety of different transformation protocols have been 
developed. 

25 In many transformation applications, it is desirable to introduce the exogenous 

DNA in a manner such that it is incorporated into a target cell's genome. One means of 
providing for genome integration is to employ a vector that is capable of homologous 
recombination. This technique is limited due to low efficiency and is applicable for 
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cellular use only where tfee desired transformation event can be selected from a pool of 
lOOO's of possibilities. •. 

Another approach uses viral based transformation to integrate nucleic acids into 
the genome. Viral vectors are limited in their efficiency by the size of the nucleic acid to 
5 be delivered. Also^ these viruses are limited to the species and cell type mih which they 
can deliver nucleic acids.'These limitations are for both cells and whole animals, but 
whole animals have further limitations. Viruses are efficiently eliminated from an animal 
before they are able to deliver nufcleic acids. Further, if gene expression is desired from a 
delivered nucleic acid, this too can be problematic when using a virus to deliver into a 

10 whole animal. Many times, even if the virus successfully avoids the animal's immune 
response and delivers the: nucleic acid into the genome, cellular mechanisms shut off the 
gene expression due to intracellutar viral detection. Viral based vectors finding use 
include retroviral vectorsi e.g. Moloney murine leukemia viral based vectors. Other viral 
based vectors that find use include adenovirus derived vectors, HSV derived vectors, 

1 5 sindbis derived vectors, etc. While viral vectors provide for a number of advantages, their 

1 

use is not optimal in many situations. 

More effective transformation vectors are continually being produced, but these 
are primarily for cellular transformation, primarily because of the dramatic difference in 
complexity when operating in a cell versus a whole animal. Accordingly, transformation 
20 vectors that integrate nucleic acids into the genome for whole animal systems are rare. 

Therefore, there ii continued interest in the development of new transformation 
vectors. Of particular interest is the development of non-viral vectors that provide for 

stable integration of exogenous DNA into a cellular genome in an intact animal. 

•1 

Relevant Literature 

25 U.S. Patents of interest include: 6,291 ,243; 5,719,055 and 4,670,388. Other 

references of interest include: Bea^U et al, Drosophila P element transposase is a novel 

site-specific endonuclease, Gene^and Development, vol. 1 1, 2137-2151 (1997).; Ivies et 
al. (1996). "Identification of functional domains and evolution of Tcl-like transportable 
elements" Proc. Natl. Acad. Sci., vol. 93: 5008-5013; Ivies et al. (1997). "Molecular 

: 2 
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Reconstruction of Sleeping Beauty, a Tel -like Transposon from Fish, and its 
Transposition in Human Cells" Cell, vol. 91: 501-510; Luo et al. (1998). "Chromosomal 
transposition of a Tel /mariner-like element in mouse embryonic stem cells" Proc. Natl. 
Acad. Sci., vol. 95: 40769-10773; Rio et al. (1986). "Identification and Immunochemical 
5 Analysis oflBiologically Active Drosophila P Element Transposase" Cell, Vol. 44: 21-32; 
Rio et al. (1988). "Evidence for Drosophila P Element Transposase Activity in 
Mammalian: Cells and Yeast" J. Mol. Biol., vol. 200: 41 1-415; Schouten et al. (1998). 
"Transposoii Tel of the nematode Caenorhabditis elegans jumps in human cells" Nucleic 
Acid Research, vol. 26(12): 3013-3017; and Rio, D.C. "Regulation of Drosophilia P 
1 0 element trar|spositi6n", Trends in Genetics, Sep. 1 99 1 , vol. 7, No. 9, pp. 282-287. 

; \ SUMMARY OF THE INVENTION 
An animal integration vector and methods for its use in the insertion of an 
exogenous nucleic acid into the genome of a whole animal are provided. The vectors of 

1 5 the subject invention include a pair of P element transposase recognized insertion 
sequences, e.g. P element derived 31 base pair inverted repeats, flanking at least one 
transcriptionally active gene that is located sufficienty proximal to one of the P-feet to 
provide for the desired genome integration. In practicing the subject methods, a vector as 
described above carrying an exogenous or endogenous nucleic acid is introduced into a 

20 target anima[l under conditions sufficient for entry into the animal and its cells resulting in 
transposition of the exogenous nucleic acid from the vector into the animal genome, i.e., 
in the genome of at feast a portion of the cells making up the animal. The subject methods 
find use in a;variety of transformation applications, including research, polypeptide 
synthesis and therapeutic applications. 



25 



i BRIEF DESCRIPTION OF THE DRAWINGS 
FigS4 1 A to provide diagrams of the vectors recited in the Experimental 
Section, below. ^ 
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■ V DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

' Animal integration vectors and methods for their use in the insertion of an 

exogenous nucleic acid into the genome of a whole animal, i.e., into the genomic DNA of 

5 at least a portion of the cells that make up the animal, are provided. The subject vectors 

includfe two P element transposase recognized insertion sequences, e.g. P element derived 

3 1 base pair inverted repeats, flanking at least one transcriptionally active gene that is 

locateiji in sufficient proximity to at least one of the P element transposase recognized 

insertion sequences to provide for the animal genome integration. In the subject methods, 

10 a vector according to the invention that may include an exogenous nucleic acid is 

administered to a whole animal under conditions sufficient for integration of the vector 

into the animal genome to occur. The subject methods find use in a variety of different 

applications, including gene therapy applications. In further describing the subject 

invention, the subject vectors will be described first, followed by a discussion of the 

1 5 methoiis of us&ig the subject vectors for transformation of a target cell. 
I 
t 

; Before the subject invention is described further, it is to be understood that the 
invention is not limited to the particular embodiments of the invention described below, 
as variations of the particular embodiments may be made and still fall within the scope of 

20 the appended claims. It is also to be understood that the terminology employed is for the 
purpose of describing particular embodiments, and is not intended to be limiting. Instead, 
the scope of the present invention will be established by the appended claims. In addition, 
many Modifications may be made to adapt a particular situation, material, composition of 
matter^ process, process step or steps, to the objective, spirit and scope of the present 

25 invention. All such modifications are intended to be within the scope of the claims made 
herein.' | 

i * 

'.In this specification and the appended claims, the singular forms "a," "an" and 
"the" include plural reference unless the context clearly dictates otherwise. Conversely, it 

I 

4 
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is contemplated that the dlaims may be so-drafted to exclude any optional element. This 
statement is intended to serve as antecedent basis for use of such exclusive terminology as 
"solely " "only" and the like in connection with the recitation of claim elements or by use 
of a "negative" limitation 

5 

Where a range of values is provided, it is understood that each intervening value, 
to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, 

between the uppdr and lower limit of that range, and any other stated or intervening value 

I 

in that stated range, is encompassed within the invention. The upper and lower limits of 
1 0 these smaller ranjges may. independently be included in the smaller ranges, and are also 
encompassed within the invention, subject to any specifically excluded limit in the stated 
range. Where the stated range includes one or both of the limits, ranges excluding either 
or both of those included limits are also included in the invention. Also, it is contemplated 
that any optional' feature of the inventive variations described herein may be set forth and 
15 claimed independently, o; in combination with any one or more of the features described 
herein. • \ 

Unless defined otherwise, all technical and scientific terms used herein have the 
same meaning aS: commonly understood to one of ordinary skill in the art to which this 
20 invention belongs. Although any methods, devices and materials similar or equivalent to 
those described Herein can be used in the practice or testing of the invention, the preferred 
methods, devices and materials are now described. 

All existiYig subject matter mentioned herein {e.g., publications, patents, patent 
25 applications and hardware) is incorporated by reference herein in its entirety. The 
referenced items are provided solely for their disclosure prior to the filing date of the 
present application. Nothing herein is to be construed as an admission that the present 
invention is not entitled to antedate such material by virtue of prior invention. 
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VectofIs 

the vectors of the subject invention are P element containing vectors, i.e., P 
'elementcontaining transposon vectors, by which is meant that the vectors include at least 
5 the P element transposase recognized insertion sequences of the Drosophila P element. 
As suchj the subject vectors include a pair of the 3 1 base pair inverted repeat domain of 
jthe P element, or the functional equivalent thereof, i.e., a domain recognized by the P 
element encoded transposase. The 31 base pair inverted repeat (i.e. P foot) is disclosed in 
Beall et al., "Drosophila P-element transposase is a novel site-specific endonuclease," 

10 Genes dev (Aug 15, 1997)1 l(16j:2137-51. 

In the subject vectors, the pair of P element encoded transposase recognized 
insertion sequences (i.e. P feet) flank at least one transcriptionally active gene that is in 
Sipproximation to, i.e., is sufficientyly close to or sufficiently proximal to, at least one of 
the P fe^t so as to provide for the desired genomic integration. By at least one is meant 

15 bne or more, usually no more than five, and more usually no more than four, where the 
number of transcriptionally active genes in the vector is often one, two, or three, where 
pnly one: of the transcriptionally active genes need be sufficiently proximal to the P foot. 
By in approximation to, i.e., sufficiently close or proximal to, is meant that the 
transcriptionally active gene is located at a distance from one of the flanking P-feet that 

20 typically is less than about 7000 bp and often less than about 6,000; 5,000; 4,000; 3,000; 
ior 2,000 bp, where in many embodiments the distance separating the transcriptionally 
active gene from the P foot does not exceed about 1,000 bp. In certain embodiments, the 
exogenous nucleic acid that is inserted into the genome of a whole animal in the subject 
tnethod^l described in greater detail infra, is one of the transcriptionally active genes of 

25 the vectors. 

By transcriptionally active gene is meant a sequence that is at least capable of 
.binding the transcriptional factors that initiate the transcription of DNA sequences into 
RNA sequences under intracellular conditions, e.g., any requisite expression regulatory 
•elements that are required for expression in the intracellular environment. As such, the 

6 
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transcriptionally active genes of the subject vectors typically include a stretch of 
nucleotides or domain, i^e. expression module, that includes a coding sequence of 
nucleotides in operational combination, i.e. operably linked, with requisite trascriptional 
mediation or regulatory dement(s). Requisite transcriptional mediation elements that may 
5 be present in the expression module include promoters, enhancers, termination and 
polyadenylation signal elements, splicing signal elements, and the like. In many 
embodiments, the transcriptional active gene or expression module is also correctly 
referred to as an expression cassette. 

Preferably, the expressioft module includes transcription regulatory elements that 

1 0 provide for expression of the gene in a broad host range. A variety of such combinations 
are known, where specific transcription regulatory elements include: SV40 elements, as 
described in Dijkema et al., EMBO J. (1985) 4:761; transcription regulatory elements 
derived from the LTR of the Rous sarcoma virus, as described in Gorman et al, Proc. 
Nat'l Acad. Sci USA (1982) 79:6777; transcription regulatory elements derived from the 

1 5 LTR of human cytomegalovirus (CMV), as described in Boshart et al.. Cell (1 985) 

41 :521 ; hsp70 promoters, (Levy^Holtzman ,R. and I. Schechter (Biochim. Biophys. Acta 
(1995) 1263: 96-98) Presnail, J.K. and M.A. Hoy, (Exp. Appl. Acarol. (1994) 18: 301- 
308)) and the like. ; 

In addition, the subject vectors typically include at least one restriction 

20 endonuclease recognized site, e.g., restriction site, located between the P feet which 

serves as a site for insertion of art exogenous nucleic acid. A variety of restriction-sites are 
known in the art and may be included into the vector, where such sites include those 
recognized by the following restriction enzymes: Hindlll, Pstl, Sail, Accl, Hindi, Xbal, 
BamYll, Smal, Xmal, Kpnl, Sad 5 EcoRl, and the like. In many embodiments, the vector 

25 includes a polylinker, i.e;, a closely arranged series or array of sites recognized by a 
plurality of different restriction enzymes, such as those listed above. 

The inter P feet dbmain of the vectors, i.e., that domain or region of the vector 
located or positioned between thf P feet which includes at least one transcriptionally 
active gene and the exogenous niicleic acid, when present, may vary greatly in size. 

7 
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Typically, the size of this inter P feet domain (i.e., P feet flanked domain) is at least about 
50 bp in length, usually at least about 1,000 bp in length and more usually at least about 
2,000 bp in length, Where the length of this domain may be as long as 150,000 bp or 
longer, but sometimes does not exceed about 50,000 bp in length and in certain 
5 embodiments does riot exceed about 20,000 bp in length. 

In certain enibodiments, the subject vectors further include a transposase encoding 
domain, i.e., .a region of nucleotides having a sequence that encodes a protein having 
transposase activity, particularly a transposase activity that recognizes the P feet of the P 
element, specifically a protein having the P element transposase activity, i.e., P element 

10 transposase or a functional equivalent or mimetic thereof The amino acid sequence of the 
P element transposase is disclosed in Rio et al.. Cell (January 17, 1986) 44: 21-32. A 
specific transposase encoding nucleic acid that may be present on the subject vectors is 
that found in pTURBO, where the sequence of this plasmid is disclosed in W.R. Engels, 
Bioess. 14: 681-686 (1992); and at the URL created by placing http:// in front of 

1 5 "firefly .bio.indiana: and .edu at then end of this term. When present on the subject 
vectors, this P elemejit transposase encoding region or domain is located outside the 
region flanked by the P feet. In other words, the transposase encoding region is located 
external to the regiori flanked by the P feet, i.e., outside the inter P-feet domain described 
supra. Put another w^y, the tranposase encoding region is positioned to the left of the left 

20 terminal P fopt or thd right of the right terminal P foot. 

The subject vectors can be used to stably insert a wide variety of endogenous 

s 

and/or exogenous nucleic acids into the genome of a whole animal (exogenous means a 
nucleic acid having a sequence that is not present in the target cell while endogenous 
means, a nucleic acid that pre-exists in the target cell, prior to insertion). By genome of a 
25 whole animal is meant that the nucleic acid is inserted or integrated into the genomic 

DNA of at least a poijion of the cells that make up the whole animal, e.g., at least about 5 
number %, often at lejast about 10 number % and sometimes at least about 25 number % 
or more, including at ieast about 30, 40, 50, 60, 70, 80, 90, 95, and 99 number % or more 
of the cells in; the whble animal or targeted organ(s) and/or tissue(s). In many 



8 
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embodimeiats, the individual cells of the animal into which the nucleic acid is integrated 
are distributed over a wide range of tissue types, e.g., at least about 5 different and often 
at least about 10 different tissue types, but may be targeted to one specific tissue or organ 
type. The mature of|he nucleic acid will vary depending the particular protocol being 
5 performed. ;For example, in research applications, the exogenous or endogenous nucleic 
acid may be a novel gene whose protein product is not well characterized. In such 
applications, the vector is employed to stably introduce the gene into the genome of the 
target whole animal and observe changes in the phenotype in order to characterize the 
gene. Alternatively^ in protein synthesis application, the exogenous or endogenous nucleic 

10 acid encodes a protlin of interest which is to be produced by the animal. In yet other 
embodiments wherb the vector is employed as a gene therapy vector, the exogenous or 
endogenous nucleic; acid is a gene having therapeutic activity, i.e. a gene that encodes a 
product of therapeutic utility. The nucleic acid may vary greatly in size. Generally, the 
size of the nucleic acid that is carried by the vector is at least about 50 bp, usually at least 

15 about lOOO^bp and more usually at least about 2000 bp, where the length may be as long 
as 150,000 bp or logger. In many embodiments, the exogenous nucleic acid is long, by 
which is m^ant that it ranges in length from about 30 to 50 kb. 

The| subject vectors may further comprise one or more elements required for 
amplification of the vector in a prokaryotic host, e.g. E. coli. Elements that may be 

20 included on the vector for use in amplification of the vector in a prokaryotic host include: 
an origin of replication, a selectable marker, and the like. 

Methods of Preparing the Subject Vectors 

» 

25 The'vectors of the subject invention may be produced by standard methods of 

restriction enzyme cjeavage, ligation and molecular cloning. One protocol for 
constructing the subject vectors includes the following steps. First, purified nucleic acid 
fragments containing desired component nucleotide sequences as well as extraneous 
sequences are cleaved with restriction endonucleases from initial sources, e.g., the 

^ 9 
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Drosophild? element. Fragments containing the desired nucleotide sequences are then 
separated frbm unwianted fragments of different size using conventional separation 
methods, e.g., by agarose gel electrophoresis. The desired fragments are excised from the 
gel and ligated together in the appropriate configuration so that a circular nucleic acid or 
5 plasmid containing the desired sequences, e.g. sequences corresponding to the various 
elements of the subject vectors, as described above is produced. Where desired, the 
circular molecules so constructed are then amplified in a prokaryotic host, e.g. E. coli. 
The procedvires of cleavage, plasmid construction, cell transformation and plasmid 
production involved in these steps are well known to one skilled in the art and the 

10 enzymes required fpr restriction and ligation are available commercially. (See, for 

example, R.; Wu, Ed., Methods in Enzymology, Vol. 68, Academic Press, N.Y. (1979); T, 
Maniatis, E; F. Fritsch and J. Sambrook, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1982); Catalog 1982-83, 
New Englaiid Biolabs, Inc.; Catalog 1982-83, Bethesda Research Laboratories, Inc. An 

1 5 example of how to construct a vector of the present invention is provided in the 
Experimental section, infra, and depicted in Figure 2. 

Methods of Using the Subject Vectors 

I 

20 The Subject vectors find use in a variety of applications in which it is desired to 

introduce and stably integrate an exogenous or endogenous nucleic acid into the genome 
of a whole animal. As mentioned above, exogenous nucleic acid means a stretch of 
nucleotides that is not initially present in the target cell, while endogenous nucleic acid 
means a nucleic acid that pre-exists in the genome of the animal. In many embodiments, 

25 the sequence of nucleotides present in the exogenous nucleic acid will be one that is not 
found in thq genome of the animal. The subject methods can be used with a variety of 
animals. Whole animals with which the subject vectors may be employed are generally 
vertebrates,;where in many embodiments the animals are mammals. Of particular interest 
in many embodiments is the use of the subject vectors to target vertebrate animals. 



10 
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pahicularly avian and marine animals, e.g., chickens, zebrafish, and the like; mammalian 
animals, including murine, porcine, ovine, equine, rat, dog, cat, monkey, humans;, and the 
like. 

, In .the methods of the subject invention, a P element vector as described above is 
5 introduced into or administered to (e.g., injected into, fed to, etc.) a whole animal or 

directly to. specific tissue(s) and/or organ(s) under conditions sufficient for excision of the 
P feet flanked nucleic acid from the vector and subsequent integration of the excised 
nucleic acid into the whole animal genome, as defined above, i.e., into the genomic DNA 
of a least a portion of the cells making up the whole animal. As the P element vector is 

10 introduced into the animal "under conditions sufficient for excision and integration to 
occur," the subject method further includes a step of ensuring that the requisite 
trahsposase activity is present along with the introduced vector. Depending on the 
structure of the vector itself, i.e., whether or not the vector includes a region encoding a 
pro;duct having P element transposase activity, the method may further include 

1 5 intf oducing a second vector into the animal which encodes the requisite transposase 
activity, i 

Thie vector (and second vector where necessary) may be introduced into the 
animal using any convenient protocol, where the protocol may provide for in vivo 
introduction of the vector. In vivo protocols that find use in delivery of the subject vectors 

20 inqlude delivery via lipid based, e.g. liposome vehicles, where the lipid based vehicle may 
be -targeted to a specific cell type for cell or tissue specific delivery of the vector,. Patents 
di^losing such methods include: U.S. Patent Nos. 5,877,302; 5,840,710; 5,830,430; and 
5,&27,703, the disclosures of which are herein incorporated by reference. Other in vivo 
delivery systems may also be employed, including: the use of polylysine based peptides 

25 as carriers, which may or may not be modified with targeting moieties, microinjection, 
electroporation, and the like. (Brooks, A.I., et al. 1998, J. neurosci. Methods V. 80 p: 137- 
47;,Murartatsu, T., Nakamura, A., and H.M. Park 1998, Int. J. Mol. Med. V. 1 p: 55-62). 
Because of the multitude of different types of vectors and delivery vehicles that may be 

employed,^ administration may be by a number of different routes, where representative 

V 
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routes of administration incliide: oral, topical, intraarterial, intravenous, intraperitoneal, 
« 

intramuscular, etc. The particiilar mode of administration depends, at least in part, on the 
nature of the delivery vehicle Employed for the vectors. In many embodiments, the vector 
or vectors are administerejl intravascularly, e.g. intraarterially or intravenously, 
5 employing an aqueous based delivery vehicle, e.g. a saline solution. 

The amount of vector nucleic acid that is introduced into the animal is sufficient to 
provide for the desired excision and insertion of the exogenous nucleic acid into the 
genome. As such, the. amount of vector nucleic acid introduced should provide for a 
sufficient amount of transposase activity and a sufficient copy number of the exogenous 

10 nucleic acid. The amount of vector nucleic acid that is introduced into the animal varies 
depending on the efficiency of the particular introduction or transfection protocol that is 
employed. , Si 

Following introduction of the vector DNA into the animal in combination with the 
requisite transposase, !the nuclfeic acid region of the vector that is flanked by the P feet of 

1 5 the vector, i.e., the vefctor nucjeic acid positioned between the P element transposase 
recognized 31 base pair terminal repeats, is excised fi-om the vector via the provided 
transposase and inserted into the genome of the animal. As such, introduction of the 
vector DNA into the animal is followed by subsequent transposase mediated excision and 
insertion of the exogenous nucleic acid carried by the vector into the genome of the 

20 animal. [t 

Because of th^ particular properties of the subject transposon based vectors, the 
subject vectors may b'le used t6 integrate large pieces of DNA into the genome of the 
whole animal. Generally, the size of DNA that the vectors insert into the genome is at 
least about 50 bp, usually at l^ast about 1000 bp and more usually at least about 2000 bp, 

25 where the size may be as large as 1 50,000 bp or larger. Where the vector inserts a large 
piece of DNA into the genome, the size of the inserted DNA ranges from about 30 to 150 
kb. ■ I 

The subject methods qf stable integration of exogenous nucleic acid into the 
genome of a target anjmal find use in a variety of applications in which the stable 
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integration of an ologenous nucleic acid into the genome of an animal is desired. 
Applications in which the subject vectors and methods find use include: research 
applicatioris, polypeptide synthesis applications and therapeutic applications. Each of 
these reprejsentativQ categories of applications is described separately below in greater 
5 detail. f 

Research Applications 

Examples of research applications in which the subject vectors find use include 

10 applicatioiis desigried to characterize a particular gene. In such applications, the vector is 
employed to insert a gene of interest into an animal and the resultant effect of the inserted 
gene on the animaUs phenotype is observed. In this manner, information about the gene's 
activity and the natiire of the product encoded thereby can be deduced. The vectors can 
also be employed td identify and define DNA sequences that control gene expression, e.g. 

15 in a temporal (e.g. pertain developmental stage) or spatial (e.g. particular cell or tissue 
type) manrier. In suth assays, the subject vectors are employed to stably integrate into the 
genome ofa animal a selectable marker gene, e.g. antibiotic resistance, LacZ, etc., where 
the vector lacks a sufficient promoter for the marker the gene such that the marker is not 
significantly expressed, if at all, unless it is underneath an endogenous promoter element. 

20 If the marker gene is inserted into the animal's genome in sufficient relationship to an 

endogenous promoter sequence, it will be expressed. From the resultant expression profile 
of the marker gene, the endogenous promoter that is mediating its expression can then be 
characterized. Yet another research application in which the subject vectors find use is in 
the identification and characterization of the resuhs of gene expression studies. For 

25 example, a plurality of animals with distinct integrated vectors are prepared in which the 
gene of interest is inserted into distinct locations in the genome, where expression of the 
gene of interest is dependent on endogenous promoter mediation, i.e. where the gene of 
interest lacks a promoter or is coupled to only a weak promoter. By plurality is meant at 
least two, where the' number usually ranges from about 2 to 20,000, usually from about 2 
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to 200. This plurality of vector targeted animals may be produced by introducing the 

vector in aiplurality of animals or taking a collection of pretargeted animals that are 

homogenous with respect to the insertion site of the gene, i.e. progeny of a single targeted 

animal, an^ then introducing transposase into one or more of the constituent members of 

5 the collection. The^subject vectors can also be used to study integration mutants, where 

the vector is inserted randomly into the genome and disrupts a random gene whose 

identity and phenotype can be quickly characterized using standard techniques. One can 

also employ the subject vectors to produce models in which overexpression and/or 

misexpression of a gene of interest is produced in an animal and the effects of this mutant 
■« — 
10 expression pattem are observed. One can also use the subject vectors to readily clone 

genes introduced into a host animal via insertional mutagenesis that yields phenotypes 

and/or expression patterns of interest. In such applications, the subject vectors are 

employed to generate insertional mutants through random integration of DNA, The 

phenotype and/or expression pattem of the resultant mutant is then assayed using any 

1 5 convenient: protocol. In those mutants of interest, cloning of the DNA associated with the 

phenotype and/or expression pattem of interest is readily accomplished through use of the 

i 

P-feet of the subject vector. 



20 



Polypeptide Synthesis Applications 



In addition to the above research applications, the subject vectors also find. use in 
the synthesis of polypeptides, e.g. proteins of interest. In such applications, a vector that 
includes a gene encoding the polypeptide of interest in combination with requisite and/or 
desired expression tegulatory sequences, e.g. promoters, etc., (i.e. an expression module) 
25 is introduced into an appropriate animal to serve as an expression host for expression of 
the polypeptide (e.g., mouse, chicken, goat, and the like). FoUowdng introduction and 
subsequent -stable integration into the animal's genome, the animal is then maintained 
under any necessary conditions (e.g., inducible system) sufficient for expression of the 
integrated gene. Onbe the transformed host expressing the protein is prepared, the protein 
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10 



is then purified to produce the desired protein comprising composition. Any convenient 
protein purification procedures may be employed, where suitable protein purification 
methodologies are described in Guide to Protein Purification, (Deuthser ed.) (Academic 
Press, 1990). For exajnple, the protein could be purified fi-om the milk of a goat with the 
appropriate attached signal sequences for expression and secretion into the milk. Then a 
lysate may be prepared from the milk and the protein further purified using HPLC, 
exclusion chr6matography, gel electrophoresis, affinity chromatography, and the like. 

Therapeutic Applications 



The subject vectors also find use in therapeutic applications, in which the vectors 
are employed 'to stably integrate a therapeutic nucleic acid, e.g. gene, into the genome of a 
animal, i.e. gqne therapy applications. The subject vectors may be used to deliver a wide 
variety of therapeutic nucleic acids. Therapeutic nucleic acids of interest include genes 

15 that replace defective genes in the target animal, such as those responsible for genetic 

defect based diseased conditions; genes which have therapeutic utility in the treatment of 
cancer; and thie like. Specific therapeutic genes for use in the treatment of genetic defect 
based diseaselconditions include genes encoding the following products: factor VIII, 
factor IX, P-globin, low-density protein receptor, adenosine deaminase, purine nucleoside 

20 phosphorylas^, sphingomyelinase, glucocerebrosidase, cystic fibrosis transmembrane 
regulator, a-antitrypsin, CD-I 8, ornithine transcarbamylase, arginosuccinate synthetase, 
phenylalanine hydroxylase, branched-chain a-ketoacid dehydrogenase, 
fumarylacetoacetate hydrolase, glucose 6-phosphatase, a-L-fiicosidase, P-glucuronidase, 
a-L-iduronid^se, galactose 1 -phosphate uridyltransferase, and the like. Cancer therapeutic 

25 genes that may be delivered via the subject vectors include: genes that enhance the 
antitumor activity of lymphocytes, genes whose expression product enhances the 
immunogenicity of tumor cells, tumor suppressor genes, toxin genes, suicide genes, 
multiple-drug resistance genes, antisense sequences, and the like. Because of the length of 
nucleic acid that can b& carried by the subject vectors, the subject vectors can be used to 
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not only introduce a therapeutic gene of interest, but also any expression regulatory 
elements, such as promoters, and the like, which may be desired so as to obtain the 
desired temporal and spatial expression of the therapeutic gene. 



Kits 



Also provided are kits for use in practicing the subject methods. The subject kits at 

least include a vector described above having an expression module sufficiently proximal 
I 

to,a P-foot or a means for making the same, e.g., a provector having restriction 

10 endonuclease site, e.g., multiple cloning site, proximal to a P-foot and, optionally, a 

corresponding restriction endonuclease(s). The subject kits may further include other 

components that find use in the subject invention, e.g., buffers, delivery vehicles, etc. 

; The various components of the kit may be present in separate containers or certain 

compatible components may be precombined into a single container, as desired. 

15 In addition to the above components, the subject kits will further include 

instructions for practicing the subject invention. These instructions may be present in the 

subject kits in a variety of forms, one or more of which may be present in the kit. One 

form in which these instructions may be present is as printed information on a suitable 

medium or substrate, e.g., a piece or pieces of paper on which the information is printed, 
I 

20 inlthe packaging of the kit, in a package insert, etc. Yet another means would be a 

computer readable medium, e.g., diskette, CD, etc., on which the information has been 
recorded. Yet another means that may be present is a website address which may be used 

i 

via the internet to access the information at a removed site. Any convenient means may be 
present in the kits. 

25 : 

The following examples are offered by way of illustration and not by way of 
limitation. . 
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: EXPERIMENTAL 

1. P Element Cohtaining Vectors 

5 Description of Vectors used-The nucleic acid vectors used for this experimental 

section were based wholly or in part (restriction digested and subcloned DNAs) on the 
pCasper4, pBluescriptll, pcdna3.1, and delta2-3 (or related transposase producing vector). 
The constructs made were as follows: Restriction enzymes were used to replace the white 
gene in the pCasper4 Vector with the CMV-bgal and SV40-neomycin resistance genes and 

10 this new construct will be referred to as C3. 1 . The next construct was created by removing 
the CMV-bgal gene from the C3.1 vector and this construct will be referred to as Cneo. 
Next the bgal gene and S V40 promoter were removed linking the CMV promoter to the 
neo resistance gene and this construct was called Ccmvneo. Next, the SV40-neo gene was 
removed from C3.1 and this construct was called Cbgal. Next, the CMV-bgal gene was 

15 replaced in the C3.1 vector with pBluescriptll and this construct was called Cbsneo. 
Finally, pBluescript w&s added into C3.1, between the CMV promoter and the 3'P foot 
and this construct was called Gbs3. 1 . 

i 

Name txn units size genes promoters position of promoters 



Vector 


Size 


Transcriptional 
Units' 


Promoters 


Genes 


Promoter 

Position 


Casper4 




0 


N/A 


N/A 


N/A 


C3.1 


1 


2 


CMV, SV40 


Bgal, neo' 


CMV near 
Pfoot 


Cneo 




1 


SV40 


neo' 


SV40 near 
Pfoot 


Ccmvneo 




1 


CMV 


neo' 


CMV near 
Pfoot 


Cbgal 


I 


1 


CMV 


Bgal 


CMV near 
Pfoot 
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Cbsneo ' 


7 

;^ 


1 


SV40 


neo' 


None near 
Pfoot 


Cbs3.1 


i 


2 


CMV, SV40 


Bgal, neo' 


None near 
P foot 



A digram oteach of the above vectors is provided in Figures 1 A to IG. 



II. Integration in animals is dependent upon concentrations of transposase and 
5 P element Vector. 

A. Strategy: Animals are co-injected with C3.1 and the transposase producing 
vectors. Th^ injectipn empoyed is a systemic tail vein injection. Different mice receive 
different concentrations of each vector in order to determine a concentration that enables 

10 the C3.1 veetor to integrate into the genome of the mouse. Concurrently, mice are also 
injected witli just the transposase vector at the highest concentration (3 |ig) to determine 
the time coijrse for |egredation. After degradation of the transposase producing vector 
any integrations are?stable in the genome of the mouse. Analyses of the genomic DNA 
from these ipice is then made. 

i : 

15 ; 

B. Exp^riment::;Male mice were injected twice, with a week in between each 
injection. Tho monihs after the second injection, genomic DNA was harvested from a tail 
preparation jfor eacli- mouse and analyzed. PGR assessment never detected the transposase 
vector. The jPGR assessment of the G3.1 vector integrating into the genome occurred in a 

20 concentration deperMent manner. That is, the more of the vector that integrates or the 
more of the vector that enables the integration in the injection mixture gave a stronger 
positive resuh (more integrations into the genome of the mouse). The results are provided 
in the following table. 



25 
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Mouse Number 

i 


Transposase Vector 
(^g) 


C3.1 
(Ktg) 


PGR Results 
(Transposase/C3 . 1 ) 


' 1 


0.5 


1 


-/- 


2 ) 


1 


1 


-/- 


■ 3 : 


3 


1 


-/- 


, 4 ^ 


0.5 


3 


-/- 


! 5 


1 


3 


-/- 


: 6 . 


3 


3 


-/+ 


: 7 .. 


0.5 


5 


-/+ 


; 8 ? 


1 


5 


-/++ 


; 9 


3 


5 


-/++ 


: 10 


0.5 


10 


-/++ 


' 11 i 


1 


10 


-/++ 


■ 12 


3 


10 


-/++ 



10 



15 



no band 
+ weak band 
strong band 

Rats have also been examined for the integration of the C3.1 vector and the results are 
as seen above, modifying the amount of DNA based on the weight difference between 
a mouse anii rat. 

III. . Integration occurs in every cell type, tissue type and organ examined 

I i 

Mquse #8 Was used as the best positive with the least amount of DNA injected to 
determine the ^tent of gene delivery that occurred in the mouse. That is, we harvested a 
wide variety of tissues from the mouse and extracted the genomic DNA to ascertain the 
breadth of tissues that could have gene delivery via the C3.1 vector. The C3.1 vector was 
robustly detected in genomic DNA from every tissue examined, some of which included 
testis, liver, spleen, heart, lung, brain, intestine. This has been repeated with the same 
results on female mice. 



20 
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IV. Integratec^ vector^^ are heritable 

Mice were injected as the #8 mouse in the table above and mated as described in 
the table below. A}1 litters tested had at least one transgenic offspring. Southern blots and 
5 PGR analyses of the offspring indicated that typically 1 to 3 integrations occur per 
haploid genome for the injected concentration. The integrated DNA has been stable 
through four generations in mice. Analogous results were generated using rat matings. In 
addition, targeted injection into the specific organ and tissue type produces non-systemic 
integration. For example, injection into the testis directly showed heritablity, but no 
1 0 integrations were detected putside of the testis. 



Mating 


% Transgenic 
Progeny (N, 
#independent liters) 


Male 


Female 


Time Post Injection 




Injected x2 ' 


No Injection 


2. weeks 


25 (12, 2) 


Injected xl 


Injected xl 


l-2days 


40 (20, 4) 


Injected xl 


Injected xl 


2 weeks 


38(13, 2) 


Injected xl 


Injected xl 


5 weeks 


71(14, 2) 


Injected xl 


Injected xl 


1 yr. 


44(9,2) 


Control Injected xf 


Control Injected xl 


l-2days 


0(19, 3) 



V. One actively t^anscril^ed gene in proximity to at least one of the P feet is essential 
15 for animal integration! 

The functional elements for integration into animals were examined. The two aspects 
that were important are that at least transcriptional active promoter is in the vector and 
that it is located near to a P foot. Though not wishing to be bound by any particular 
20 theory, this fmding!signifiei^ that the DNA binding aspect of the promoter brings 

transcriptional factors into .proximity of the P foot. As transcriptional factors relax the 
compaction of DNA in ord^r to allow protein access, this may be the key step for 
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transpisase excision and integration. That is, any mechanism that relaxes the DNA in 
proximity to the P foot is likely to enable the transposase to integrate the P element into 
the genome of an animal. 



Vector 


PGR results 


Casper4 


- 


C3.1 




Cneo 


+ 


Ccmvneo 




Cbgal 




Cbsneo 




Cbs3.1 





VI. ; Genomes from avian, mice, rats, humans do not have sequences that are 
sinlilar to the P feet 

PGR assessment for the prescence and absence of the 5T foot and 3T foot were in 
part used to detect integration of the vector into the genome of animals. During these 
experiments, it became clear that the hallmark sequences of the P element (the P feet) are 
not naturally present in non-fruitfly animals as no 5T or 3T feet sequences were detected 
in genomic DNA isolated from birds, mice, rats, and people. 

VIL Integration of a large DNA fragments into a target cell 

P element vectors as described above have the capability to mobilize up to 150 kb 
of DNA . This presents the opportunity of the P element carrying a DNA sequence with 
therapeutic use of similar length into a target cell. The manipulation and handling of DNA 

fragments up to 4 megabases is difficult but has been reproducibly achieved. For 

i 
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example, this vector could allow^the potential for analyzing genes on the size of the 

i ■ * 

human huntington gene 50 - 200 kb). 

It is evident fromithe above results and discussion that the subject invention 
5 provides a novel nucleic 'acid vector having a broad range of applications. Advantages of 
the subject vector over other known nucleic acid vectors include the ability to integrate 
long stretches of nucleic acid into the genome of a population of target cells of a whole 
animal, i.e., to integrate tjhe nucleic acid into the genome of a whole animal. This feature 
is advantageous for a nurpber of reasons, including the ability to integrate an exogenous 

1 0 gene along with its nativ^ expreslion regulatory elements. Another feature of the subject 
invention is that the vectors provide for random insertion of a foreign nucleic acid, which 
is desirable in many applications' Yet another advantage of the subject invention is that 
the subject vectors do not elicit host immune reactions, in contrast with many viral based 
vectors. In addition, the subject viectors do not carry risk of viral infection or 

1 5 recombination hazards. As such, the subject vectors provide a number of advantages over 
other vectors currently kriown an^ employed in the art, and therefore represent a 
significant contribution t(p the art.^ 

All publications and patent applications cited in this specification are herein 
20 incorporated by reference as if eath individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of 
any publication is for its disclosure prior to the filing date and should not be construed as 
an admission that the preknt invention is not entitled to antedate such publication by 
virtue of prior invention. » 
25 Although the foregoing invention has been described in some detail by way of 

illustration and example for purppses of clarity of understanding, it is readily apparent to 
those of ordinary skill in the art iii light of the teachings of this invention that certain 
changes and modifications may hh made thereto without departing from the spirit or scope 
of the appended claims. [ 

y ; 22 

i 

B, F & F Reference: TOSK-OOT 
F:\DOCUMENT\tosk\007\patent application.doc 



